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‘ ABSTRACT 


The fault location model under inyestigation consists of an 
n-component series system known to have exactly one failed component. 
Component positions in the system are taken as fixed. A component is 
either working or failed. Components work or fail independently of 
each other, with theip a prionisreliabilatiés taken as given but not 
necessarily equal. Group testing to locate the failed component is 
sequential, binary and dichotomous in nature with certain results. 
The only costs are the number of tests made. The three solution 
procedures investigated are (1) a dynamic programning formulation, 
(2) a sequential halving procedure, and (3) a procedure based on 
information theory. The criteria for optimality are minimization of 
the expected number of tests required and minimization of the maximum 


number of tests required. 
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IT, THE GENERAL FAULT LOCATION PROBLEM 


A. PROBLEM STATEMENT 

The most general form of the fault location problem may be stated 
as follows: One is given a population consisting of individual 
elements. This set is partitioned into two disjoint and exhaustive 
subsets. The problem posed is to specify procedures designed to 
locate in an efficient manner those elements belonging to the first 
subset, 

The property which distinguishes those elements of the first subset 
is connected with some sort of deficiency or fault in most applications, 


hence the term "fault location problem", 


B. DISTINGUISHING MODEL CHARACTERISTICS 


The general fault location problem encompasses many specific models, 
1, Population.Size 
Model population size is either finite or countably infinite, 
The assumption of an infinite population can simplify calculations in 
many instances, 
2. ~Number of Defectives Known to Exist 
"Number of defectives" here is equivalent to the cardinality 
of the first subset in the general formulation. This number is generally 
taken at the outset of the problem to be either completely unknown, one 
or more, exactly one, or unknown but greater than some specified number. 
3. Probability of Elements Being Defective 
The a priori probability of being defective is given for each 


element. With no information as to the number of existing defectives 








each element is defective with this probability, independent of the 
State of other elements. Information on the number of existing defec- 
tives yields a posterior probability of being defective for each 
element given this information. These posterior probabilities are not 
independent, 
4, Test Outcomes 
A test is an action taken to identify elements belonging to 
the defective set. Tests may result in binary outcomes, i.e., pass/fail 
where these terms are appropriately defined; or they may result in 
multiple outcomes. 
5. Certainty of Test Results 
Test results may be certain, i1.e., indicate the true state of 


nature; or they may be uncertain, in which case probabilities of 


Paeaonaniis tect outcomes are shec fic 


[ee 


6. Number of Elements Included in a Single Test 


The number of elements included in a test may be one or more, 
the latter termed group testing. 
jee vest Costs 
Test costs are either assumed to be equal for all tests, or 
unequal and specified. Costs are measured in terms of the appropriate 


Scarce resource, e.g., time, money, etc. 
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IT. QVERVIEW OF WORK IN FAULT LOCATION PROBLEMS 


The following discussion of work which has been done in the general 
area of fault location problems is not claimed to be either an exhaus- 
tive enumeration of all such work or a summary of all the results 
achieved in the papers mentioned. Rather, it is intended to give the 
reader an appreciation for the scope of the general problem, a brief 
description of some of the work which has been done, and an idea of 
just where the model of this thesis fits. 

The first work in this area is generally attributed to Dorfman [7] 
in 1943. The problem he considered was involved with the requirement 
levied by World War IT on the Public Health Service and the Selective 
Service System to weed out syphilitic men from the large number of 
incoming inductees. Blood samples were tested using the "Wasserman-type" 
blood test wnicn detects the presence of a syphilitic antigen. Prior to 
Dorfman‘s efforts, blood samples were tensted on an individual basis, 
requiring a large effort on the part of laboratory personnel. Dorfman 
found that one could pool the blood samples from a number of men and 
test this pooled sample. If none of the men was infected, the entire 
group could thus be passed with a single test. The blood sample drawn 
from each man was sufficient to conduct two tests. The remaining blood 
for each member of an infected group could thus be tested individual ly 
to identify the infected member(s). Dorfman considered the population 
of inductees to be infinite. He established optimum group sizes and 
relative expected test costs (as opposed to individual sample testing) 
for selected a priori probabilities that any member of the population 
would be infected. This became known as the “blood testing problem". 


Dorfman's work showed the advantage of group testing to be inversely 





related to the a priori probability of infection, with the break-even 
point occurring at a probability of about 0.15. He considered the tests 
to be binary in nature with certain results. 

In 1960, Ungar [25] expanded on Dorfman and others' work on the 
blood testing problem. He considered the population to be of finite 
size n, each member with the a priori probability p of being infected. 
His results show that the range of p for which group testing is 
advantageous is 0 <p < 1/2(3 -V5) # 0.38. Ungar's group testing 
allows grouping of individuals for all tests. 

Finucan [8] in 1964 also extended the results of Dorfman in an 
algebraic treatment for Dorfman's two-stage method (group testing at 
the first stage, then individual testing at the second), and also for 
methods using three or more stages. As in Dorfman's work, Finucan 
assumes the nonulation to he infinite with each member hayine an ecaual 
prior probability of being infected. As in Dorfman and Ungar, minimi- 
Zation of the expected number of tests required is the criterion adopted 
for optimality. 

Sobel and Grol] [21] in 1959 also extended Dorfman's original efforts. 
In their model the population size is finite, the number of elements in 
each group test is not necessarily constant, and if a group test fails, 
each element is not necessarily tested individually. Several procedures 
are deyeloped with conditions for their optimality under ranges of Pp 
values, The criterion used is the expected number of tests required to 
eliminate all defectives. Several industrial applications are also 
cited. This work was extended by Sobel [20,23] in 1960 and 1964. Kumar 
and Sobel [24] considered a variant of this model in 1970. They begin 
with an infinite population, each of whose elements has an equal prior 


probability of being defective. In this model an optimum group testing 








procedure is obtained in the sense that it minimizes the expected 
number of tests required to find the first defective. 

Black [1] in 1965 formulated a search model in which the finite 
population consists of "regions" in which a single target may lie. A 
test consists of a "look" in a region for the target. The binary 
results of these tests are uncertain in that each region is assigned 
Ms» a conditional miss probability, i.e., probability that given the 
target is in the ah region, it will not be detected on a single look 
there. Each region is also assigned a prior probability that the 
target is there (p.), and a cost per look (c.). His solution requires 
the computation of a figure of merit, p,(1-m.)m""/c,, for each of the 
i regions and for as many of n looks as desired. These values are 
placed in a two-dimensional array (by n and i). The order of search 
of the reaiane is in decreasing ardar of their fiauras of merit = Rlack 
also derives a means for calculation of the expected cost of the search. 
His model is equivalent to individual element testing with the cost of 
searching each region independent of the search effort previously 
expended in other regions. He references Matula [16], a student of 
Blackwell, who originally drew upon the results achieved by Blackwell [2] 
in his "ball-in-box" problem. Klein [5] in 1967 also formulated a 
search model involving a single target to be located, which employs 
Markovian decision models. 

Gaddess [6] addressed in 1969 a related problem for which very few 
results have been obtained, His work concerns the specification of test 
points within a modular combinational logic circuit with the objective 
of insuring diagnosability to the module level of the overall circuit, 


j.e., insuring that there exists at least one sequence of binary test 


ia 








inputs which distinguishes any two single faults in separate modules. 

He derives three techniques for allocating test points: (1) A method 
analogous to the prime implicant covering problem of classical switching 
theory, (2) a graphical worst case analysis, and (3) a combination of 
the first two methods. 

Zimmerman [26] in 1959 investigated a model in which the population 
is finite with exactly one defective element. Each element has a known 
(but not necessarily equal) probability of being defective. Testing is 
binary with certain results. Each test partitions the elements into 
two groups; the test outcome reveals which group contains the defective 
element. This group then comprises the elements for the next test. 
Testing is complete when the defective is found. Optimality is judged 


on the expected number of tests required. Zimmerman views the problem 
peewee cahed Comb teetion ed to rensatbed dig immer: 
His results involve the combination at any stage of the two elements 
having the smallest prior probabilities. These results appear to the 
author to be equivalent to a coding procedure developed by Huffman [12] 
iil ES aya 

Sandelius [19] in 1961 examined a particular case of the model 
previously treated by Zimmerman. Sandelius‘ model assumes that all 
elements have an equal prior probability of being defective. Testing 


is as described above for Zimmerman. To state Sandelius' theorem, we 


define the following: 
ale 2 ee 


number of elements to be tested 


where n 


m. number of elements remaining to be tested after 1 tests 


; 
have been conducted 


Wz 








kK. S LO liane 
re (Ose ne eae 


Sandelius proves the following theorem: When al] p, are equal the 
minimum expected number of tests jis kK + (2r/n) » and this will be 
attained if and only if the number of elements of each of the two 


th test (h = 0,1,2,...) belongs 


groups which are tested in the (h + 1) 
to the closed interval pakhol okhy, 

Johnson [18] in 1956 couched his models in terms of electronics 
equipment to be checked out. His population is finite and considered 
from two related levels. In the first, the population is composed of 
the electronic components which form the system. The second level 
population consists of the individual parts of which each component is 
composed. Sequential binary testing with certain results takes place 
on the components level first, then on the parts level of a located 
failed component. Qn each level elements are tested individually. 
Components and their associated parts are assigned prior probabilities 
of failure, and required test times (broken down further into the sum 
of the time for removal for testing, the time for the actual test, and 
the time for replacement). Optimum test sequences minimize the expected 
delay time, measured from the time the system is initially tested to the 
time it works. Johnson considers both multiple failures in the system, 
and the case of one failure. 

Gluss [10] in 1959 considered a model equivalent to Johnson's case 
of exactly one failure described above, with the exception that test 
results are no longer certain. Attached to each element is a conditional 


probability that a single test of that element will fail to indicate 








failure, given the element has actually failed. Firstman and Gluss [9] 
jn 1960 expanded Gluss' model to include "false alarms", i.e., tests 
which indicate failure when no failure has occurred. 

Butterworth [4] expanded the results of Johnson to examine specific 
types of systems. His models treat optimum sequential binary testing 
with certain results of individual components from a finite population, 
Each component's prior probability of failure and the time required to 
test it are known, The systems treated are the k-out-of-n type (k/n), 
7.e€., the system works if and only if k or more of its components 
work. The series (n/n) and parallel (1/n) systems are included as 
Special cases. In his first model a feasible testing procedure must 
determine the system state, assuming the components to be independent. 


In other models for which it is assumed that the system has failed, a 


Reds ible tecting proceduyro must locate el) failed components. Tasting 
procedures are judged in terms of the expected time required to complete 
testing. 

Brulé, Johnson and Kletsky [3] in 1960 investigated models phrased 
in the electronics system form. The individual components compose the 
finite population. They consider first a model allowing multiple 
component failures. Their goal is to construct a general testing diagram 
to show in node and arc form the successive system states, as represented 
by binary n-tuples, and the sequential tests to perform in locating the 
failed components, They conclude that even simple systems can require 
extremely complex testing diagrams, but that this requirement may be 
reduced to tractable proportions if the allowed number of failed 
components is reduced to exactly one. Their resulting testing diagram 


constructions are then based on this stipulation. They also examine 








sequential binary test procedures for two special cases. For both cases, 
testing is performed as described above for Zimmerman. The first 
special case, is termed the equal cost-equal probability case, in which 
the prior probabilities of failure and test costs for all components are 
equal. They cite a procedure termed the “half-split technique" which 
is equivalent to that described above in Sandelius‘ work, In this case, 
this procedure leads to an optimum solution in terms of minimizing the 
exptected cost of testing. For this special case, they claim that this 
procedure 1S a minimax solution, i1.e€., it minimizes the maximum possible 
cost. They further claim that it yields the minimax solution when the 
probabilities of failure are unequal. The author refutes both these 
claims, A counterexample is attached at the end of Appendix C. Their 
second special case is the equal cost-unequal probability case. They 
show an analogy between this situation and that described by Huffman [121 
in his optimum coding problem. To employ Huffman's procedure the compo- 
nents must be arrangeable in order of decreasing reliability, thus one 
must be able to group them in this manner under the testing procedure. 
Kletsky [15] in 1960 expanded on both the work mentioned above by 
Brulé, Johnson and himself and that of Johnson [13]. He shows an example 
of the application of a figure of merit based upon information theory in 
the formulation of a testing diagram corresponding to an "efficient" 
sequential test procedure. The example system is the power supply of a 
Standard USAF communications receiver. Estimation of posterior probabili- 
ties of failure given that the equipment has failed for each component is 
Shown through a frequency analysis of the equipment failure history. 
Implicit in these calculations is the assumption that any equipment 


failure results from exactly one failed component. The figure of merit 


iPS 





th 


F, representing the ratio of ambiguity removed by the k test to 


k 
the cost of performing the test, is as follows: 


-Plog. p - (1-P )1og,(1-P) 
bile a a Eee . 
k 
where P is the probability that the test will pass and C. is the cost 
of performing the test. Kletsky's procedure is as follows: 

1) Evaluate F, for each of the possible tests. 

2) Choose the test with the largest Fy. 

3) Alter the cost of performing the remaining tests on the basis of 
having performed this and other previous tests. 

4) Alter the probability of passing for each of the remaining tests 
on the basis of knowledge gained by having performed this and 
other previous tests. 

5) Repeat the procedure until the entire sequential testing diagram 
is determined, 

As an indication of the "efficiency" of the above-described procedure, 
Kletsky offers only the statement that "...the information theory tech- 
nique yields a diagnostic procedure which is not essentially different 
from that which would be used by a competent technician". 

Kovacs [11] in 1968 expanded on Kletsky's work, He derives an 
equivalent figure of merit for multiple outcome tests which reduces to 
the above form for binary tests. He also discusses the use of multiple 
outcome versus binary tests, and presents a method for the incorporation 
of multiple outcome tests on a binary test diagram. In evaluation of 
the information theory figure of merit approach, Kovacs states simply, 
",.,.this approach opens the door to...the most rapid and efficient 


checkout procedure," 








TIT. MODEL DESCRIPTION 


A. SYSTEM 

The population under investigation in this model can be described 
as a series system of a finite number of components. In terms of the 
system descriptions used by Butterworth [4], it is an n/n system in 
which the system works if and only if all its components work. If the 
system does not work, it 1s said to be failed. The system is most 
easily visualized as an electronic system with a single signal path, 


as shown in Figure 1. 


input <a. ——+| 9 -routput 


An n-Component Series System 
Figure 1 


B. COMPONENT ATTRIBUTES 

Component positions in the system are taken as fixed, such as in a 
series resistor network. As a convention, the system is as shown in 
Figure 1, with the input to the left and the components numbered sequen- 
mieuly from left to right. 

A component may be in one of two states, i1.e., working or failed. 
Components work or fail independently of each other. The a priori 
probability, p; for 1 = 1,2,...,n, that a component is working is taken 


as given, but not necessarily equal for all components. This probability 


17 








is sometimes called the reliability of the component, and may be 
expressed as the probability that the component will perform satis- 
factorily throughout the duration of some specified mission. The 
determination of these probabilities is not considered here. 


The system is assumed initially to have exactly one failed component. 


eee PESTING 

Testing is to be sequential, binary, and dichotomous in nature with 
certain results. It is dichotomous in that each test partitions the 
set of components known to contain the failed component into two subsets. 
It is sequential in that the outcome of any test will determine the next 
test to be conducted, thus the result of each test is evaluated prior to 
conducting the next test. Testing is binary in that only pass/fail 
results are obtained. The only costs are the number of tests made. We 
May Link of the tests being conducted as testing across a number of 
resistors for continuity using an ammeter with one probe of the ammeter 
fixed at the left end of the system. A test across the first two 
components is shown in Figure 2. If this test "passes", it indicates 
continuity through components | and 2, indicating that the failed 
component lies to the right of component 2. If the test "fails", it 
indicates no continuity through the first two components, thus one of 
them must be failed (since test results indicate the true state of 


nature). “Failed" in this context is equivalent to "open" in the 


Testing Across Components 1 and 2 


electrical sense. 





Figure 2 








Suppose the test in Figure 2 passes. Then the next test might be 
say as shown in Figure 3, where the set of components 3 through n is 
partitioned into sets consisting of components 3 through 5, and 6 
through n. If this test fails, the failed component is now known to 
be either component 3, 4, or 5; and if it passes, the failed component 


is now known to lie to the right of component 5. 





Testing Across Components 1-5 


Figure 3 


This sequential testing scheme requires a decision of where to place 
the right hand probe for each succeeding test, based upon the results of 
all previous tests. To specify any particular solution, these decisions 
must be specified for both pass and fail results for each test. The 
number of tests required to locate the failed component will be a 
random variable, depending on the test plan adopted and the component 
which has actually failed. We will let N represent this integer- 
valued random variable, and denote its expected value by E(N). 

Utilization of our testing procedure will result at any point in the 
Set known to contain the failed component being composed only of compo- 
nents adjacent to each other, such as components i through j, i <j. 
We shall use this fact to describe the system's state at any point in 
the testing procedure as follows: Let S(i,j) indicate that components 
1 through j inclusive comprise the set known to contain the failed 


component. Then let d(i,j) = k indicate the decision to test across 





a = 








the first k components when the system state S(i,j) is realized. 
Note that ke {i,it],...,j-]}. Further, if this test passes, we have 
S(k+1,j), and if it fails S(i,k). Testing is complete when we reach 
S(i,i) for some i, for then component ji is known to be failed. 

Diagrammatic representation of a derived test plan solution is best 
represented in the node-arc form as shown in Figure 4. In this repre- 
sentation, the rectangular nodes show sequential system states which 
may result throughout the procedure and their associated decision 
variable values. The circular nodes indicate the located failed 
component. This representation shows all possible paths through the 
procedure, each terminating with the location of the failed component. 
For example, suppose component three is actually the failed component. 
The test across the first four components fails, resulting in state 
meme), thea next test across the first three components alse fails, 
resulting in state S(1,3). The third test across the first two 
components passes, indicating component three to be failed (the terminal 
State $(3,3)). Thus, three tests are required under this procedure to 
locate the third component. 

For purposes of compactness, we wish to reduce the diagram of 
Figure 4 to tabular form. An equivalent procedure is depicted in 
Table I. The "TEST" and "STATE" columns are self-explanatory. The "DEC" 
column denotes the decision corresponding to the state to its left. The 
"COMP" column denotes the component(s) which, if failed, would be 
located by the test. This tabular form will be utilized to describe 


solutions to example cases. 
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1 fSs,10) 1 | l= Sieeonneaal| 

| d*(5,10)=6 : © || 4*(7,9)8 } & 
: Fail | pass fail | pass 
| LT s(s,6) | \s07,8) 
| ! | i] d*(7,8)=7 
| | | | fail 
! | | | @ 
7 | | | 

| | | | 

| | i | 

| | | 

: | | 

| | ! 

| : | 

| | : | 

| | | | 

| | | 

| | | | 

| | | | 

| STAGE 2 | STAGE 3 | STAGE 4 | STAGE 5 


sample Test Plan for 10-Component System 


Figure 4 
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TEST STATE DEC COMP 


1 1,10 4 

2 1, 4 3 4 
5,10 6 

3 1,3 2 3 
5, 6 5 5, 6 
7G 9 10 

4 1, 2 1 (ee: 
7, 9 8 9 

5 7, 8 7 7, 8 


Tabular Form of Test Plan 


TABLE I 


D. OBJECTIVE 

Qur objective is to examine several solution procedures based on 
Reeve. NS Crilerta Por crear: 

1) Minimization of the expected number of tests required. 

2) Minimization of the maximum number of tests required. 

Minimization of the expected number of tests required is the cri- 
terion most frequently encountered in the literature, Implicit in the 
use of such an expected value criterion is the assumption that testing 
is to be repetitious, yielding good results over the “long haul". In 
many cases, the results of any one realization of a system test may be 
disastrous in terms of the number of tests actually required. To 
hedge against this kind of disastrous realization, one may instead 
employ the criterion of minimizing the maximum number of tests required 
to locate the failed component. This is termed a "minimax" criterion. 

Of course, if one could devise a testing procedure which would be 


optimum in terms of both criteria, he might then consider the procedure 


a 











to be ideal. In general, however, one would expect that tradeoffs 
would exist between the two. In the practical sense then, one must 
examine the attributes of alternative solutions in order to choose the 
one which best suits his needs. 

We will examine three procedures. The first procedure involves 
the solution of a dynamic programming formulation of the problem. 
This results in a test plan which minimizes the expected number of 
tests required. 

The second procedure, termed the sequential halving procedure, 
is shown to minimize the maximum number of tests required. 

The third procedure, termed the information theory approach, is a 
heuristic procedure based upon maximizing the expected reduction in 
_the uncertainty of the failed component's location at each succeeding 


wmnnnciwnn ae wunt rnlaamnaA ¢n kA ARtamm Gh tree nf nsthan 
er Asta te eo we eau le K. © 07 Ceaperc ha tt.) av &, RACP A, ERIECALET bos Q-% 8 das a ae | N\- 8 teraele 
« 


criterion, yet its ease of computation suggests it to be worthy of 


investigation. 
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IV. MODEL SOLUTIONS 


A. DYNAMIC PROGRAMMING FORMULATION 


1. Nomenclature 


n = number of components which comprise the system. 

q. = probability that component i is failed, given that 
the system is failed with exactly one failed 
component. aD ons I. 

S(i.j) = system state denoting that the failed component is 
known to be one of the components i,itl,...,j-l,j. 
Ove Tidgt a s- 

Q. (455) = minimum expected number of tests required to locate 


the failed component beginning in state S(i,j) and 
testing across the first k components on the next 
ose. [liquemiirte eee 4 areas soll. 

(4,5) = I Qid) Ke iit, g-1. 

d(i.j) = k denotes the decision to test across the first k 


components on the next test, beginning in state S(i,j). 


deta.d) = the decision at S(i,j) which results in f(1,j), j-e., 
the optimal decision at S(i,j). 
ao uage 


We shall define the stage of the problem in terms of the system 
State. From the testing description in Section III, it follows that 
all possible system states can be enumerated in terms of the stage 
Wariable s as follows: S(i,its) for i = 1,2,...,n-s and s = 0,1,...,n-1. 
We shall let the stage correspond to the number s, so stages are 


numbered from zero to n=-1, and at each stage s the possible system 
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states are S(i,it+ts) for 7 = 1,2,...,n-s. For example, at stage zero 
the possible states are S(1,1),S(2,2),...,;S(n,n). At stage three the 
possible states are S(1,4), S(2,5),..., S(n-3,n). Finally, at stage 
n-1 the only possible state is S(1,n), which corresponds to the system 
state prior to any testing. The procedure to be employed is termed 
"backward recursive optimization" by Nemhauser [17]. Thus, while 
actual testing proceeds from stage n-1] to stage zero, solution of the 
problem begins with stage zero and works back to stage n-l. 

3. Decision Variable 

The decision variable at any state S(i,j) is 

See) = 11,it1,...,j-1}. 

4, Objective Function 


The objective is to minimize the expected number of tests 


e . . e e t a (oo | tal 317 
raninivad tn Tarnanatn thn Fant Tal nawrrnenen cnt ee 5 ite fogyetoos —atolew 
i te iiwn wv @wuu vue via iyi o OW Ot Codetrezieetee foe vith wee ote Le iia en ad al at Pt a 


—-—— — 


ts} 


Find f(1,n) = ca [Q,(1,n)] = min E(N), 
o» General Recursive Equation 
Let the system be in state S(t,m) at stage s =m- t. Then 


the general recursive equation is as follows: 
min 


f(t.m) =, [Q,(t.m)], k © tt,ttl,...,m-1}, 
where Q(t sm) is calculated as follows: 
Q.(t,m) = PLT + #(t,k)] + (1-P)[1 + f(kt1,m)] , (1) 
k 
2 44 
where p= 2% 
m 
» q 
ist 


Note here that P is the probability that state S(t,k) results from 


this test and [1 + f(t,k)] is the minimum expected number of tests 


as 








required for completion of testing should S(t,k) result. The second 
term in (1) is similarly interpreted. 
See initial Conditions 
As we have stated above, testing is complete when the system 
state is S(i,i) for some i , since we then know component ji to be 
mamed. Thus f(i,i) = 0 for i = 1,2,...,n. 
At state S(i,i+]) we have only to make one further test, thus 
emt) = 1, and d*(j,i+]) = 1 for i = 1,...,Med. 
7. Solution Procedure 
With the posterior probability of failure q. given for each 
of the system's hn components and the f values corresponding to all 
possible system states at stages zero and one given as shown above, 


begin with stage two. For each of the possible system states at stage 


emmegenes;s of the form S(i.4e2) forete= @ ee Calas 

Q, (i 142) for k = i1,i+1. For each state S(i,i+2), f(i,i4+2) = mg (iit2)1, 
where Q. (4 ,i#2) is calculated from (1). With each state S(i,i+2) 

associate the optimal decision variable d*(i,it2) which yields f(i,i+2). 

Continue similarly for subsequent stages. Upon completion of 
Stage n-1, a value for f(1,n) has been obtained. This is the minimum 
expected number of tests required to locate the failed component. 

For the complete specification of the derived test plan, trace 
back through the stages and their associated d* values along 
sequential testing paths which terminate with state S(i,i) for all i. 

8. Computer Program 

A computer program, written in FORTRAN IV for use on the 

IBM 360-67 digital computer has been developed and used in the example 


cases included in Section V. A program listing and sample output is 


included following Appendix C. 
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B. SEQUENTIAL HALVING PROCEDURE 

The sequential halving procedure is independent of the probabilities 
of failure attached to the system components. It can be described as 
follows: At any point in the procedure if m components remain to be 
tested, the next test should divide them as equally as possible. Thus, 
if m is even, the next test divides the components into two equal 
groups of m/2 each. If m is odd, the next test divides the components 
into two groups with m/2 - 1/2 in one group and m/2 + 1/2 in the 
other. For consistency, we will always take the smaller of the two 
groups to the left, e.g., if components 1-/ remain to be tested, we 
test next across components 1-3. 

It is shown in Appendix C that this procedure 1s minimax. In the 
Special case where the component probabilities of failure are all equal, 


ome 


mereemurOCrtuure Sauna 
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in Section II, and thus also yields the minimum expected number of 
tests required. In other cases, the expected number of tests required 


may not be minimized through this procedure. 


C. THE INFORMATION THEORY FORMULATION 

The information theory solution to the problem is a heuristic 
approach, based upon maximization of the expected reduction of uncertainty 
inherent in the system at each succeeding test. This procedure was 
mentioned in passing as a special case by Brulé, Johnson and Kletsky [22] 
in 1959. 

The procedure may be simply stated as follows: Suppose the failed 
component is known to belong to a set of m components. The component 
posterior probabilities of failure given that one of the m_ components 


m 
is failed, qa. are known with Y& q; = 1. Test next across the first 
1=1 








k components, choosing k_ such that | 


qi - Wie) is 


M1 we 


] 
a minimum, 


A derivation of an expression for the computation of component 
posterior probabilities is contained in Appendix A. A derivation of 
the information theory procedure and an algorithm for its use are 
contained in Appendix B. 

The attributes of the information theory solution in terms of the 
two criteria for optimality listed above is unclear, except to assert 
that the expected number of tests required will be at least that given 
by dynamic programming, and the maximum number of tests required will 
be at least that given by the sequential halving procedure. Further 
evaluation of this procedure is the purpose of the example cases 


considered in the next section. 








V. EXAMPLE CASE RESULTS 


Figures 5 through 19 summarize the salient features and results 


obtained in seventeen example cases. 


fey 10 FIGURES 5 THROUGH 19 


The following definitions apply to the entities of these figures: 


N = the number of tests required to locate the failed 
component. 

E(N) = the expected value of N. 

V(N) = the variance of N. 

a, = the maximum number of tests required to locate the 


failed component. 

0 = the 2 oyiowi relies] waver comcoment 7. 

q. = the posterior probability component i is failed, 
given that the system has one failed component. 

State = (i,j) denotes the set of components i through j 
inclusive is known to contain the failed component. 

Dec = 7 denotes the decision to test across the first 1 
components on this test. 

Comp column denotes the component(s) which, if failed, would be 
located by the test. 


pmf denotes the probability mass function values for N. 


Complete test plan specifications are shown for both the dynamic 


programming and information theory solutions for all 20-component cases. 


Cases 15, 16 and 17 are 40-component cases, for which only summary 


results are included. 
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B. SEQUENTIAL HALVING TEST PLAN SPECIFICATION 
The sequential halving test plan depends only on the number of 
components which comprise the system and is independent of the a priori 


component reliabilities assigned. 


TEST STATE DEC COMP 
] Ao 10 
2 Ws Me 5 
Misco ie 
3 es 2 
6,10 i 
PS 12 
16,20 ie 

4 1, 2 1 lige 

Cir S. 3 3 

One. 6 SA: 

SG 8 8 

laliect2 1) Wick 

eat 1s 1s 

lice? 16 Gaal 

L820 18 18 

5 4, 5 4 4, 2) 

9,10 9 9,10 

eoraleo 14 146 

19,20 19 Wehaalo 


20-Component Sequential Halving Test Plan 
TABLE II 


C. A PRIORI COMPONENT RELIABILITIES 

A priori component reliabilities do not necessarily sum to unity, 
and are therefore not probability distributions. 

Example case 1 is the 20-component equal probability case. 

Component reliabilities are linearly arranged for cases 2 through 5, 


in order of increasing slope. 


30 








Component reliabilities are symmetric and follow the familiar 
"bell-shaped" curve in cases 6, 7 and 8 in order of increasing 
peakedness. 

Cases 9, 10, 11, 15, 16 and 17 contain component reliabilities 
randomly selected between Q.7 and 1.0. 

Component reliabilities are geometrically arranged for cases 12, 
13 and 14 by Po0-( 4-1) = (p)' TOY 17152,.-4se0. Tiemweva lucsmieed 


pene. 95, 0.90 and 0.85 respectively. 


D. COMPUTER PROGRAM VALIDATION 

The computer program was validated first by running four and five- 
component cases for which the solution had been calculated manually. 

For 10, 20, 40 and 80-component cases, it was validated by running 
the equal probability case for which the solution is given by Sandel ius' 
theorem stated ii Section Ii. For these cases the EN) vaiue is 
accurate to at least four decimal places. 

The solution given by the computer program does minimize E(N), but 
does not indicate the existence of alternate optima. Such alternate 
solutions do in fact exist for the equal probability case. Thus, 


where the pharse "the dynamic programming solution" appears, perhaps 


the phrase "a dynamic programming solution" may be more appropriate. 
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EXAMPLE CASE | 


Procedure ey) ue Nnax 

Dynamic Prog 4.4000 0.2400 5 

Info Theory 4.4000 0.2400 5 

Seq Halving 4.4000 0.2400 5 
Pj 
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0 1, Comp 
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DYNAMIC PROGRAMMING 





Test Stare 4 6pec Comp pint 
eco 10 0 
Z eel 5 
PieZ0 15 
3 ie) 2 0 
6,10 7 
ile. | 5 2 
16,20 17 
4 lem 2 ] bene .6000 
3) ae 5 S 
Ehe/ 6 Gar 
8,10 8 8 
be. 12 1] alge 
15 is 13 
io. 17 Ge lOs iy 
18,20 18 18 
5 a. 5 4 5 §) . 4000 
9,10 9 SNe) 
ie 1 eens 
12520 19 19,20 
Figure 5 
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VI. CONCLUSIONS 


A. MODEL APPLICABILITY 
The assumptions that the system components are in series, and that 
exactly one failure occurs are not as restrictive as they appear at first 
glance. 
1. Series-Parallel Systems 
In many instances a complex system consisting of series and 
parallel networks can be reduced to an equivalent series system. 


Consider the system shown in Figure 20. Let R be the a priori 








Series-Parallel System 


Figure 20 


reliability of component i for i=1,2,...,11. Suppose the components 
work or fail independently of each other. 


The reliability of an n-component series system is 


n 
i = eee ees 
S cai 4 








Since the probability the system works is equivalent to the probability 

that all components work. This is commonly called the "product rule". 
Since (1-R.) is the probability component i is failed, 

I (1-R, ) is the probability that all components are failed. All 

nents of an n-component parallel system must fail to cause system 


failure, thus 


is the reliability of a parallel system. 

With the above expressions for Re and Ry the system in 
Figure 20 may now be reduced to an equivalent series system. First, 
components 2 and 3 form a parallel system. They can be replaced by an 


equivalent component I with 


Ry = |- (1-R,)(1-R3). 


Similarly, components 7 and 8 can be replaced by component IIA in 


series with component 5 with 
Rita =] - (1-R5)(1-Rg). 


Now, components 4, 6 and 9, and components 5 and IIA form two series 
Systems. Thus the reliabilities of the upper and lower branches of II 
are RaRERo and RERI TA respectively. These two branches form a 
parallel system, so they can be replaced by the equivalent component II 
with 


RY =] - (1-RAR-Rg) (I-RRi yp) 
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The system has now been reduced to an equivalent series system 
consisting of components 1, I, II, 10 and 1]. The model may now be 
employed to derive a test plan for the equivalent system. 

If the failure is located to lie in component I, both components 
2 and 3 are then known to be failed. Should component II _ be failed, 
its upper and lower branches are then treated as failed series systems 
for which the model is applicable. 

The above procedure to reduce the system to an equivalent series 
system requires a "grouping" of individual components which in turn 
constrains some properties of the total system test plan. Thus, the 
three solution procedures are not claimed to be optimal. 

2. Component Dependence 

The fact that some inter-dependence exists among the components 
Of most systems 1s not to be denied. However. one would expect such 
relationships to be difficult to define at best. It is therefore 
questionable whether the cost required for a correlation study of 
multiple failures would be justified in terms of improvement of results. 

Two or more components may be dependent to such a degree that 
the independence assumption is no longer an acceptable approximation. 

In this case one may still be able to convert the system to consist 

only of independent components. Each set of dependent components may 

be combined into one equivalent component. These equivalent components 
then operate independently in the system. This aggregation over 
dependent components creates a system with independent components, but 
also creates the above-mentioned "grouping" constraints on the test plan. 
This tradeoff is a factor in the determination of the appropriate level 


Of aggregation. 
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Once the components are assumed to function independently, even 
moderate system reliabilities yield small probabilities for multiple 


component failures. Consider the case in which the component reliabili- 


mires are all equal, 1.e., Ro = R fOV Si)... ..n- slencte Enesnumben 


of failures by X, and note that X is distributed Binomial (n,1-R). 


Then let 
P, = P{one failure|system down} = Be) 
y T-P(X=0) 


and, P{multiple failures|system down} = 1 - P, = Q 


System reliability, Re = Rp" = P(X=0). These quantities for various 


combinations of n and R_ are shown in Table III. This shows that, 


Example n R P, Q, R 


5 
] De Oss d 0450) 0. Cee? 
“ SS S/oe Us ee. [) cisis 
3 POPeO 30; C1, Imercie «0.60 
A TORSCr 275 J05So) OF ie Oar 


Conditional Failure Probabilities 
for Equal Reliability Case 


Table III 


for example one, the ratio of P, to 1-P, is nine to one. An 

increase in component reliabilities (examples one and two) increases 
this ratio. An increase in the number of components, maintaining the 
Same component reliability (examples one and three) reduces the ratio, 
but system reliability is also degraded. With an attendant increase 


in component reliability (examples one and four) to retain approximately 
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the same system reliability, the large ratio is preserved. System 
reliabilities of operational equipment are generally high. Thus the 
assumption of a single failed component seems reasonable. 
3. Constrained Maximum Number of Tests 

Should there exist a constraint on the maximum number of tests 
allowed, e.g., for budgetary or physical reasons, this model is not 
appropriate. The author understands that a dynamic programming formu- 
lation for this constrained problem has been developed, but that 
computer CPU time and core storage requirements limit its application 
to small systems. 

For larger systems then, this model can be used to generate 
three solutions from which to choose. Given the maximum number of 


tests n, select that solution from the three for which 


= —s se th AR TA a nt 
P(N < rn} ~ P(M=4 } ies Liew aaa GOS lao 


ta 


1=] 


B. MERITS OF THE INFORMATION THEORY PROCEDURE 
1. Expected Number of Tests Required 
For the criterion of minimizing E(N), the information theory 
solution is inferior to that generated by dynamic programming. The 
crucial question is whether the ease of computation of the information 
theory solution can offset its non-optimality. 


Let E(N), = E(N) given by dynamic programming, 


E(N). = E(N) given by information theory, and 
E(N). = E(N) given by sequential halving. 
E(N). - E(N), 
Define PD. = Ey (100) = = the percent degradation from the 


optimal solution, for i = 2,3. 
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Over the results of all 20-component example cases, the average 


PD, = 0.6%, with the maximum PD, = 2.5% on case 9. Over these same 


2 


cases, the average PD. = 9.6%, with the maximum PD. = 56.4% on case 2. 


S 
The 20-component cases for which the component reliabilities 

were selected randomly between 0.7 and 1.0 yielded the largest 

PD. values. Three 40-component cases with component reliabilities 

Similarly selected were also run. These results, summarized in Table IV, 

indicate that in terms of the E(N) criterion, the information theory 

Solution is quite good, at least for up to forty components. Further, 

the PD. values remained the same order of magnitude when the number 


of components was doubled from twenty to forty. 


Nr 


Case Comps fo2 3 
9 20 a8 625 
10 20 fa || O60 
1] 20 eal 4.9 
1S 40 ile 520 
16 40 Ls 5.0 
leh 40 iG 6.9 


Random (0.7-1.0) Case Results 
Table IV 


2. Maximum Number of Tests Required 


Of all example cases conducted, only in case 5 did ie new 


the information theory solution exceed that of the dynamic programming 
solution. In cases 6, 7 and 8 (all with component reliabilities 
symmetrically arranged), Lanes for the information theory solution was 
Strictly less than that of dynamic programming. For these cases, the 
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first decision d*(1,20) = 10 appeared to be primarily responsible 
for this result. Note that for symmetrically arranged component 
reliabilities, information theory will always split them in half on 
the first test. One may conjecture here that Nae for the information 
theory solution is likely to be less than or equal to that of dynamic 
programming. 

3. Computation Costs 

The computer program written for the dynamic programming solu- 
tion to the example cases is not claimed to be optimum in terms of the 
CPU time or the core storage required, yet it is believed to be repre- 
sentative of the CPU time and core storage requirements for this 
problem solution technique. 

To obtain an estimate of the relationship between CPU time 
Beeeeeccs and the numbar of camaamewis ia the svsteme Che Tinmagy 
Subroutine SETIME/GETIME was employed. Three cases each were run for 
10, 20 and 40 components and one 80-component case, all] with component 
reliabilities randomly selected from 0.7 to 1.0. The CPU times 


required for both calculations and output of results were recorded as 


shown in Table V. The time required for solution calculations heavily 


Nr Calculation Output 
Comps _Time(sec) _ Avg Time (sec Avg 
10 0.0346 0.0300 
10 0.0334 0.0340 0.0316 0 WE 
me . 90341 USS, | re 
20 0.2514 0.0592 
ZU 0.2410 0.2457 0.0543 0.0570 
_ a : ee UES) ee ae 
40 1) SINS 0.1234 
40 1.9657 1.9504 0.1191 OF 2a 
Relesege, 1.2520 ae. _ eer. Oslin Wee. 
80 14.5887 14.5887 0.2161 Ore 161 


CPU Time Requirenents 
Table Vy 








dominates the total time required as the number of components is 
increased. Further, as the number of components is doubled, the CPU 
time required for these calculations increases by a multiplicative 
factor of between seven and eight. Accordingly, projected CPU time 


requirements for larger cases are shown in Table VI. 


Nr Projected 
Comps Calculation Time 
160 102-117 sec. 
320 12-17 min. 
640 84-136 min. 


Projected CPU Time Requirements 


Table VI 


Core storage requirements and projected storage requirements 


for larger cases are shown in Table VII. 


Projected 
Nr Core Storage Nr Core Storage 
Comps (Bytes x 1000) Comps (Bytes x 1000) 
10 45.0 160 a5) 
20 47.5 sya) 868 
40 Oy ao 640 Sie 


80 96.5 
Core Storage Requirements 


Table VII 


As iS apparent from Tables VI and VII, core storage requirements 
would be the controlling factor in limiting the maximum number of 
components which could be considered in the dynamic programming computer 


program. 
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The information theory solution procedure is simple enough to 
require only basic desk calculator skill. Forty-component cases 
required about an hour and a half for the author on a desk calculator 
(not including the time required to calculate q-values). The practical 
limits of this solution method are estimated to be at about 100-160 
components. For systems substantially larger the author believes that 
a computer program could be developed which would require far less core 
Storage than the dynamic programming solution. Thus, for large systems, 
it may well be the case that information theory and sequential halving 


may provide the "best" alternative solutions. 
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APPENDIX A 


SYSTEM POSTERIOR PROBABILITIES 


A. STATEMENT OF THE PROBLEM 


Given the a priori probabilities that the components are working, 


find the corresponding posterior probabilities given the system has 


exactly one failed component. 


B. DERIVATION 


Define the following: 


Pp; = a priori probability component 7% 1S working. 


q; = 1 - ps =a priori probability component i is failed. 


a. 1 - q. = posterior probability component i is working given 


1 


the system has exactly one failed component. 


event. 


{A.} = the event component ji is working. {A} is the complementary 


{B} = the event exactly one component is failed. 


We wish to find Ps = P(A. |B), Seen: 


ES) 
io 
i 


H 
om 
J 
ar 
—s 
Cuse 


q qT p. 
w=1  K jdk J 
n 
ye 
i=] 


and we have that q. =]. 


P(A,|B) = 1 - P(A.|B) = 


1 


P(A. (\B) 
P(B) 








C. COMPUTATIONAL FORM 
To apply the information theory procedure we must first find 
q. for i = 1,2,...,n. We seek a form of (1) which is more suitable 


for hand calculations: 


O 0 O 0 O70, 20 
Gi iiaipe Gages Q=/ pele) 
;, ee ee eye ee 
rm " o O ©). oy SORE 
zg, Tp; = gy /Py (Ps) (Ips) 2 ap/Py 
eile ie k=] j jue HS) 
Oye. 
: q./P; 
eo (2) 
A Oy 
q 
w=1 *K 


D. ALGORITHM FOR COMPUTATION 
1. Calculate a°/ps for each component (a° =] - n°?) 
2. Sum results of step (1). 


3. Divide each qe /P; from step (1) by the result of step (2). 








APPENDIX B 
INFORMATION THEORY PROCEDURE 


A. DERIVATION 


Define the following: 


US = the uncertainty (entropy) present in an n-component system 
Known to have exactly one failed component. 

ue = the uncertainty remaining after one test across the first 
m components. 

a = posterior probability component ji is oS given the 
system has exactly one failed component. eo = ]. 

i* = the number of the failed component. i 


Khinchin[14] offers the following rationale for the uncertainty 
(entropy) concept in probability theory. A complete system of events 
is a mutually exclusive and exhaustive set of events Ay sAos++ oA. 
Given the events of a complete system, together with their probabilities 
of occurrence q).q9,---.9, (4; > 0, 2 q. = 1), we have a finite 
scheme. For example, one toss of a true die yields the finite scheme 
{A,,...sAgh with A. = {die shows i}, hence qs = 1/6 tor 1c ae 
Any finite scheme describes a state of uncertainty in that we don't know 
which of the events A AL will be ae 
ae eG 


i=] 
the uncertainty inherent in a finite scheme, Ays++sAs taking 


por 


Define the quantity Ul) o++ +94 In q. as a measure of 


n j 
q. In q. = 0 if q. = QO. In our problem, A. = {component i is failed}, 
n 
hence US = - Pais Tn q. 
We wish to maximize the expected reduction in uncertainty at each 


mest. tIhus, the problem is: 
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Se Pe (ed ler ee | 


E(U, - U.) = ae - E(U_) = Un - ECU.) , since Uo is a constant. 
E(U_) = P(Isi*<m)E(U,[1<i*<m) + P(m+1<i*<n)E(U,, |m+1<i*<n) 
m m q. q 
= y q (-1) 2 an In mations + 
=] =] 
ed ; 4 
1=] i=l 
n n q. q 
> q. (-1) y — In a tle 
j=m+1 j=mt+l\ - 
eee iy cau 
1=m+] 1=m+] 
n m m q q 
So E(U. - U.) =. J q. In Cees eee = lies == In = + 
i=] i=] sllliaes 2, 7 q 
fey) 
n n q. q 
| y | (ele saath In = 
emt] raat, 
i=m j=m rq, rq, 
j=m+] 1=m+] 
n m q n q 
aoe - | ) == F Gg. Ing. + £ gq.ln —J—\+ 5  q.In/—+— 
0 m jay i eae m fad n 


is! 

m q: n qn 
coe = 2 gq. In ——__| + ee Coen ——~s___ 
Wel 2 j=mt] 2 i 
2. ds ei: 

el j=m+] 
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m m n n 
Bt, OQ = 2 g.| IN G2 ie ce = y. - Gc eG ot Lec: 
Oe eee iat! j=m+1 9 : i=m+] | 


n m m n n 
SB Qe WMeGs = 2G) Vi Gi ec ei ee 
(i er ie jemt] 4 \i=me1 


n 
Sdain, Since 2 "qj In q; is a constant, we can reduce the problem to 
Jz 
' m m n n 
fax QO =- © gq. Inf 2 Gq.) = = q. in a (1) 
" ie © isi jemt1 2 i=mt1 | 
s.t. m= 1,2,...,n-1 
m 
ieee = Pii<i*<m) = 2 q5° Expression (1) is of the form: 
JF 


- a In oe - (1 - i) In(1 - Pale Further, is is an increasing function 
of m. 

Consider the function 

fx) = -xin x - (1 - x)In(i - x), O< x < 1. 
Let us maximize f(x) for O< x < 1. 


f'(x) = -In x - 1 - [-In (1-x) - 1] 


I 
© 


= -In x + In (1-x) = 0 


= iy (%) = 02x = 1-x =x = 1/2 


Further, f"(x) = - i ~~ < On ele 0 ee ss Ie 


Thus f(x) is concave over (0,1) with a single maximum at x = 1/2. 
From the above concavity argument and the fact that f(x) is sym- 

metric about x = 1/2 , it follows that in order to maximize Q., and 

consequently to maximize the expected reduction in uncertainty, one 


m 
must choose m such that | & q; - 1/2| is a minimum. 
i 
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The problem may be formulated in another way which leads to the 
same result. One may consider the amount of information given by the 
realization of a finite scheme to be equal to the entropy of the scheme 
[14]. Any test defines a finite scheme with two events, A, = 
{the test fails} and its complement A, = {the test passes}. The 


information gained from such a test is then 


lo -P(A, )In P(A, ) - P(A, )In P(A,). 


But P(A, ) = |= P(A.) SO 


U = - P(A, )In P(A, ) - (1 - P(A) In (] - P(A,)). 
m 
As developed above, P(A, ) eo) Ge when testing across the first Mm 


1=] 
components. Maximization of U subject to m=1,2,...,n-] is equivalent 
to expression (1), which now has the interpretation of maximizina the 
information gained by this test and hence leads to the same result as 


above. 


Bee APPLICATION TO SUCCEEDING TESTS 

The above derivation applies for the first test. In all subsequent 
tests, the q-values for the components must be modified to reflect the 
additional knowledge obtained on the system state. Specifically, we 
want qp , the probability that component k is failed given the 


@eeemeis in state S(i,j); for i<k <j. Note that 


q q 
% * a =e, J Ss odie 
PESGIsS)I sg 
ea 
h=1 
° J r 
which is a normalization of a such that 2% ah, 1 
k=] 


6] 








This yields a “system" consisting of components 71 through j_ known 
to contain the defective component. Now, in order to maximize the 
m 
expected reduction in uncertainty, choose m such that | & ‘, S72) 
h=1 


iS a minimum. 


C. ALGORITHM FOR COMPUTATION 
The following steps constitute the calculations necessary for test 
plan specification: 


1. Calculate component posterior probabilities of failure, qs 


for i=1,2,...,n, using the algorithm of Appendix A. 


2. State S(1,n). Form the partial sums 


sequentially for k=1,2,..., until Z(k) > 0.5 for the first time. Let 
eis k = KO Now AC < 1/2 < Z(k), and d*(],n) = KOT or KO? 
according to the following: 

Kol if (a eli, = 1/2| < [Z(k) = 1/2| 


d*(],n) = 


ope i eC a mm) => 172\(( 0) ite) 


The first of these conditions becomes 


1/2 - Zk -1) < Z(k,) - I/2 


ilk) ae 


Pal) aa 


and the second condition becomes 


1/2 - Z(k.-1) > Z(k,) - 1/2 


Z(k.) ri Z(k,-1) < ] 
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so, choose d*(1,n) according to the following: 


keel it 32) Se ie 
d*(1,n) 2 0 0 0 = 


k iat Z(k,) 1p Z(k -1) el 


Oi 


3. Resulting state determination. If at test t, d*(i,j) = k, 
then the two possible resulting states at test t+l are S(i,k) and 
Bhichl,j). 

4, State S(i,j). Form the partial sums 

K 


H(k) = 
h=i 


, 
Sh 
Bequentially for k=i,i+1,..., until Z(k) > 0.5 for the first time. 
Let this k= k . As shown above, choose d*(i,j) according to the 
following: 

Ka > 


joe eh 


lv 


eh 37 Tate 
BaGig) =< ~ 
k Tas Z(ko) “ eels) 


A“ 
— 


a: 
5. Located failed components. Any test plan must include test 
sequences to locate each component, should that component be the 
defective. If d*(i,j) = i, component i is located. If d*(i,j) = j-l, 
component j is located. Note that d*(i,i+1) = i always and locates 
both components ji and itl. 
6. Continue steps three through five until all components are 
located. This completes the test plan specification. Brulé, Johnson 
and Kletsky[3] have shown that exactly n-1 tests must be specified 
to complete the test plan. 
7. Expected number of tests required. If a test plan requires a 


maximum of Bs tests, then 
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to 
E(N) = x i-P(N=i), 
=] 


: 
where P(N=i) is the sum of the component q-values for all components 
requiring exactly 7. tests. 


8. Variance of number of tests required. 


V(N) = E(N°) - [E(N)I° 


i°-P(N=i) - [E(N)]¢ 


i 
it 1 cr 


; 
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APPENDIX C 
PROPERTIES OF THE SEQUENTIAL HALVING PROCEDURE 


A. MAXIMUM NUMBER OF TESTS REQUIRED 


Let m= the cardinality of the set known to contain the failed 


component (the defective set) after t tests have been conducted. 


Express m, in the form m= okt Pe where K, eae. lke teoand 


the remainder term, r, € ROR One - 1)}. Initially, there are 


i= m= oko + ry, components to be tested. 


Let me be the maximum number of components which could be in 


the defective set after the pth test. The pth test partitions 


the m,_, set into two subsets containing m, and m, components 


respectively, such that m, +m) =m,_, and me = max (my my). 
Let N(Il) be the maximum number of tests which nay be required to 
locate the failed component under any sequential testing procedure Il. 
Let d. be the number of components to be tested across on the 
pen Gest. 
Define the sequential halving procedure II* as follows: 
If m, = 2°t +r, then divide in the middle so that 
da) = okt-1) + [r,/2], where [r,/2] is the greatest integer less 
than or equal to r/e. 
Lemma 1. For a system of n=m, = DO & r, components, the 


sequential halving procedure II* yields 


" 
© 


k alle Ont 
0 


a k 
Ko ee eels es lg 2eeres (2eOlw lk) 
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Proof. Case I (r, = 0) Ws ako Under II*, d, = akon! and 
m) = oXo-! For the second test d5 = oko~e and om, = oko-2 For 
the third test d, = ako-3 and Ma = 9Xo-3 continue this procedure 


~ 99 _ z 
Tee 2 = 1] and Mk ills 
Testing is now complete, since the one component remaining must in fact 


until om, 4 = aKo-{ko-!) 25 ellie © 
O 


be faulty. Therefore, ky tests are required and N(II*) = ko: 


Case I] Ce #0): n=m.= ako tie where now ry & {1,2,...,(2%0 - 1)}. 


0 
acl 


Under I, d) = 2°07! + Er 2], [r./2] © {0,1,...,(2807! - 1)}. This 


test may divide the components into two groups with pon! + a 


components in one group. and DOr a 2 + 1 components in the other 
ka-1 


it i is odd, or two groups both with 2-9 + r/e components if 


on is even. In the worse case, 


k= 
k= O 
ma 2 + r/2 + | 
Alene] : oS fa Ped y. 
Suppose ry = oko"! then my = DSO DSO Ves, D0) 


Under case I, it is shown that Ko further tests are required under 
I*, yielding N(iI*) = & ole 


Suppose alternatively that ry ot Ya recon Oe 


Ko-l ry. and under II*, d., = 2 ae uF [r,/2], 


[r,/2] Bios... (2 0 


a 
ma 2 


1)} and 


m& = 2Ko? 4 [r,/2] + ] 


2 
eee 


kqg-2 
2 + 15 5 To Giles were Zu come Ne 


Kom? Then ms = 28072 + pKo-2 = pKonl 


Suppose Noe= 2 5 


Under case I it is shown that aa further tests are required under 


M*, yielding N(l*) = ky ee 








Suppose alternatively that rp e012, ..0(2%0r Se Then 


ko~2 k 


-3 
= — 0 
ms 2 + 19, and under II*, d. 2 + Lro/2], 


[r,/2] « (0,1, 00a, (20s =W)eand 


oad Ko-3 
ma = 2 + [r,/2] + ] 


Ko"S 4 i RRO”) . 


ee 


Continue this procedure until 


- 9ko-ko 
MK 2 + bry 1/21 al 


» TY € | A psa log 
0 0 


_ 40 
we 


HY 
—f 
+ 
—i 

I 
Ro 


Then “kot = 1] and me 47 7 1. As in case I above, testing is now 


complete and N(I*) =k +17, @. F, OD, 


B. MINIMAX PROPERTY 
Theorem 1. Of all dichotomous sequential testing procedures MIL, 


the sequential halving procedure II* 1s a minimax procedure. That is, 


N(I*) = eh N(I1) 


Proof. the theorem is true vacuously for n = 1, since no testing 
Heerequired. It is also true for n= 2 and n= 3, since I* jis 
the only dichotomous sequential testing procedure to follow. Note here 
miat for n= 3, dj = 2 (testing across the first two components) is 
equivalent to testing across the third component which satisfies II*. 
Assume the theorem to be true for n = 1,2,...,(s-1). To show: 


The theorem holds for n= s. 
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let n=s5s = oko ae ey eee f),..) OL . Any procedure II 
reduces at each test the number of components in the defective set by 
at least 1. So for any procedure Il, my <S = leet 1s assumede unas 
the theorem holds for n= 1,2,...,(s-1). Thus M* is the minimax 
procedure to follow after conducting the first test. Therefore, we 
need consider those procedures II' which differ from IJII* only at the 


first test. 


Case ] We = 0): s=m = oko, 


F Under any procedure II', 
ky-| Ko_ 


2 cee Min < C 


] 
(by Lemma 1): N(II') = kt > kK, = N(II*). 


1. Employing Il* for subsequent tests yields 


2 . 7ko Ko. 
Case II ca a (U)\es Bes mM 2-0 + emerge. cee eZee ce On in 


Under any procedure II', 


Ko-l 


2 + [r,/2] ser Gin me < 2Ko uaee eel: 


where r, € (1,2,... (280-1) }=> [rn /2] oar Ci Cs Be 


k eal 


oo! 4 [r./2] + 2> 2°"! + 2, for all r., and 


ako + i - ] Sao, TOR ll ee hus. for a #0 there exists no 
procedure II' for which ma < axon! + 2. Therefore, employing II* 

on subsequent tests yields N(II') = roll = N(I*) for those procedures 
for which my @ 2S, and N(II') = k +2 > kt = N(II*) for those 


procedures for which ma > aKa. Ol. +24. 1B: 


C. COUNTEREXAMPLE TO MINIMAX PROPERTIES OF "HALF-SPLIT TECHNIQUE" 
As cited in Section II, Brulé, Johnson and Kletsky [3] make two 
claims involving minimax properties of the "half-split technique”. 
They claim that it minimizes the maximum possible cost in both the 
equal cost-equal probability and the equal cost-unequal probability 


Cases. 








Since the costs are equal for all tests in both cases, we may let 
the cost of a test be unity. Then the maximum possible cost of a 
test plan is simply the maximum number of tests which may be required. 
Let the system consist of 15 components. The "“half-split technique" 
Brescribes that if m, = okt + r, components remain in the defective 


i IE 


set after t tests, t= 0,1,2,..., then the (t+1)°° 


test must 
partition these components into two groups such that each group 
contains at least se components. One may generate the test plan 
Shown in Figure 21, which follows this procedure. Under this plan the 
maximum possible cost is 6. From Lemma 1 above, we know that employing 
the sequential halving procedure will lead to a maximum possible cost 
of 4. Thus the above solution is not minimax. Note also that a 


minimax criterion 1s independent of component reliabilities. 
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"Half-Split Technique" Test Plan 


Figure 2] 
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